Time-frequency distributions for automatic speech recognition
نویسندگان
چکیده
منابع مشابه
Time-frequency distributions for automatic speech recognition
The use of general time-frequency distributions as features for automatic speech recognition (ASR) is discussed in the context of hidden Markov classifiers. Short-time averages of quadratic operators, e.g., energy spectrum, generalized first spectral moments, and short-time averages of the instantaneous frequency, are compared to the standard front end features, and applied to ASR. Theoretical ...
متن کاملOptimizing Time-Frequency Distributions for Automatic Classification
An entirely new set of criteria for the design of kernels (generating functions) for time-frequency representations (TFRs) is presented. These criteria aim only to produce kernels (and thus, TFRs) which will enable more accurate classification. We refer to these kernels, which are optimized to discriminate among several classes of signals, as signal class dependent kernels, or simply class depe...
متن کاملTime-Frequency Features For Speech Recognition
Time-Frequency Features For Speech Recognition by James G. Droppo III Chair of Supervisory Committee Professor Les E. Atlas Electrical Engineering Conventional speaker independent, continuous speech recognition systems are built upon assumptions that are, in general, not met. This dissertation focuses on one deficiency in particular, that the non-stationary speech signal is modeled as a single ...
متن کاملAutomatic Classification of Positive Time- Frequency Distributions
A method of performing automatic classification of positive time-frequency distributions is presented. These distributions are computed via constrained optimization, minimizing the cross-entropy of the distribution subject to a set of constraints. An algorithm for clustering using cross-entropy as the distance measure between vectors was derived by Shore and Gray [14]. We apply this method to t...
متن کاملA Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation
Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Speech and Audio Processing
سال: 2001
ISSN: 1063-6676
DOI: 10.1109/89.905994